metal : reduce rset heartbeat from 500ms -> 5ms by ggerganov · Pull Request #24074 · ggml-org/llama.cpp

ggerganov · 2026-06-03T12:52:05Z

Overview

Make the heartbeat loop of the Metal backend (that keeps the residency sets alive) more frequent. This is helpful for reducing the time for exiting a ggml application on macos.

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: NO

* origin/master: (57 commits) server : disable on-device spec checkpoints (ggml-org#24108) arg: fix double mtp downloads (ggml-org#24128) webui: [a11y] fix keyboard navigation issues in chat interface and sidebar (ggml-org#23132) Move duplicated imatrix code into single common imatrix-loader.cpp (ggml-org#22445) ui: Fixed packages (ggml-org#24119) ui: added single line reasoning preview (ggml-org#23601) return filter to save memory (ggml-org#24125) convert: Fix Gemma 4 Unified conversion (ggml-org#24118) ggml: vectorize ggml_vec_dot_q4_1_q8_1 with WASM SIMD128 (ggml-org#22209) server: avoid unnecessary checkpoint restore when new tokens are present (ggml-org#24110) agents: refactor, include more guidelines (ggml-org#24111) webui: fix tool selector toggle/counter, key tools by stable identity (ggml-org#24065) build : use umbrella Headers directory for XCFramework module map (ggml-org#23974) server : add header to tools/server/server-http.h (ggml-org#24089) cmake: skip cvector-generator and export-lora when CPU backend is disabled (ggml-org#24053) fix(mtmd): handle Gemma 4 audio projector embedding size (ggml-org#24091) readme : add status badges (ggml-org#24104) tests : refactor test-save-load-state to accept token input (ggml-org#24073) metal : reduce rset heartbeat from 500ms -> 5ms (ggml-org#24074) ggml-webgpu: FlashAttention refactor + standardize quantization support (ggml-org#23834) ...

(cherry picked from commit 3d19986)

metal : reduce rset heartbeat from 500ms -> 5ms

a89f13a

ggerganov requested a review from a team as a code owner June 3, 2026 12:52

github-actions Bot added ggml changes relating to the ggml tensor library for machine learning Apple Metal https://en.wikipedia.org/wiki/Metal_(API) labels Jun 3, 2026

ggerganov merged commit 3d19986 into master Jun 4, 2026
26 checks passed

ggerganov deleted the gg/metal-reduce-rset-heart-beat branch June 4, 2026 05:05

jimbothigpen pushed a commit to jimbothigpen/llama.cpp that referenced this pull request Jun 6, 2026

metal : reduce rset heartbeat from 500ms -> 5ms (ggml-org#24074)

f2b6dfd

(cherry picked from commit 3d19986)

jimbothigpen pushed a commit to jimbothigpen/llama.cpp that referenced this pull request Jun 6, 2026

metal : reduce rset heartbeat from 500ms -> 5ms (ggml-org#24074)

e78be6d

(cherry picked from commit 3d19986)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

metal : reduce rset heartbeat from 500ms -> 5ms#24074

metal : reduce rset heartbeat from 500ms -> 5ms#24074
ggerganov merged 1 commit into
masterfrom
gg/metal-reduce-rset-heart-beat

ggerganov commented Jun 3, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ggerganov commented Jun 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Requirements

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ggerganov commented Jun 3, 2026 •

edited

Loading